Term Spotting: A Quick-and-dirty Method for Extracting Typological Features of Language from Grammatical Descriptions

نویسندگان

چکیده

Starting from a large collection of digitized raw-text descriptions languages the world, we address problem extracting information interest to linguists these. We describe general technique extract properties described associated with specific term. The is simple implement, explain, requires no training data or annotation, and manual tuning thresholds. results are evaluated on gold standard database classifiers accuracy that match supersede human inter-coder agreement similar tasks. Although competitive, method may still be enhanced by more rigorous probabilistic background theory usage extant NLP tools for morphological variants, collocations vector-space semantics.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Grammatical Aspects for Language Descriptions

For the purposes of tool development, computer languages are usually described using context-free grammars with annotations such as semantic actions or pretty-printing instructions. These descriptions are processed by generators which automatically build software, e.g., parsers, pretty-printers and editing support. In many cases the annotations make grammars unreadable, and when generating code...

متن کامل

Introducing a method for extracting features from facial images based on applying transformations to features obtained from convolutional neural networks

In pattern recognition, features are denoting some measurable characteristics of an observed phenomenon and feature extraction is the procedure of measuring these characteristics. A set of features can be expressed by a feature vector which is used as the input data of a system. An efficient feature extraction method can improve the performance of a machine learning system such as face recognit...

متن کامل

the effect of lexical and grammatical collocation instruction through input flooding versus awareness raising on short-term and delayed retention as well a active use

this study attempted to explore if teaching english collocations through two different modes of awareness-raising and input flooding has any possible differential effect on immediate retention as well as retention in a delayed assessment. it also compared the possible differential effect of teaching english collocations implicitly and explicitly on actively using the items in writing. m...

15 صفحه اول

How Dirty are "Quick and Dirty" Methods of Project Appraisal?

Routine "quick-and-dirty" (QD) methods of project appraisal can be so dirty in guiding project selection as to wipe out the net social gains from public investment. A common QD method for estimating benefits from irrigation investments is tested using data for Viet Nam. The results are compared to impacts assessed through econometric modelling of marginal returns that allows for household and a...

متن کامل

SL: a "quick and dirty" but working intermediate language for SVP systems

The CSA group at the University of Amsterdam has developed SVP, a framework to manage and program many-core and hardware multithreaded processors. In this article, we introduce the intermediate language SL, a common vehicle to program SVP platforms. SL is designed as an extension to the standard C language (ISO C99/C11). It includes primitive constructs to bulk create threads, bulk synchronize ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Linköping electronic conference proceedings

سال: 2021

ISSN: ['1650-3740', '1650-3686']

DOI: https://doi.org/10.3384/ecp184172